WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 



PCX 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 
G01N 15/14 



A2 



(11) International Publication Number: 
(43) International Publication Date: 



WO 97/20198 



5 June 1997 (05.06.97) 



(21) International Application Number: 



PCT/US96719166 



(22) International Filing Date: 27 November 1996 (27.1 1.96) 



(30) Priority Data: 

60/026,805 



30 November 1 995 (30. 1 1 .95) US 



(71) Applicant: XL VISION, INC. [US/US]; 10305 102nd Terrace, 

Sebastian, FL 32958 (US). 

(72) Inventors: DOUGLASS, James, W.; 320 Miami Avenue, 

Indialantic, FL 32903 (US). RIDING, Thomas, J.; 663 
Linvitle Falls Drive, West Melbourne. FL 32904 (US). 
RING, James, E.; 1011 Sunswept Road, N.E., Palm Bay, 
FL 32905 (US). 

(74) Agents: LOCKMAN, David, M. et ah; Morris, Manning 
& Martin, L.L.P., 1600 Atlanta Financial Center, 3343 
Peachtree Road, N.E., Atlanta, GA 30326 (US). 



(81) Designated States: AL, AM. AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB. GE, 
HU, IL, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, 
LT, LU, LV, MD, MG, MK, MN, MW, MX, NO. NZ, PL, 
PT, RO, RU. SD, SE, SG, SI, SK. TJ, TM, TR, TT, UA, 
UG, UZ, VN, ARIPO patent (KE, LS, MW, SD, SZ, UG), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH. DE, DK, ES, FI, FR, GB, 
GR, IE, IT, LU, MC. NL, PT, SE), OAPI patent (BF, BJ, 
CF, CG, CI, CM, GA, GN, ML, MR, NE, SN, TD, TG). 



Published 

Without international search report and to be republished 
upon receipt of that report. 



(54) Tide: METHOD AND APPARATUS FOR AUTOMATED IMAGE ANALYSIS OF BIOLOGICAL SPECIMENS 



(57) Abstract 

A method and apparatus for automated cell analysis of biological specimens 
automatically scans at a low magnification to acquire images (288) which are analyzed 
to determine candidate cell objects of interest. The low magnification images are converted 
from a first color space to a second color space (290). The color space converted image 
is then low pass filtered (292) and compared to a threshold (294) to remove artifacts and 
background objects from the candidate object of interest pixels of the color converted 
image. The candidate object of interest pixels are morphologically processed (296) to 
group candidate object of interest pixels together into groups which are compared to blob 
parameters (298) to identify candidate objects of interest which correspond to cells or other 
structures relevant to medical diagnosis of the biological specimen. The location coordinates 
of the objects of interest are stored and additional images of the candidate cell objets are 
acquired at high magnification. The high magnification images are analyzed in the same 
manner as the low magnification images to confirm the candidate objects of interest which 
are objects of interest. A high magnification image of each confirmed object of interest is 
stored for later review and evaluation by a pathologist. 
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METHOD AND APPARATUS FOR AUTOMATED 
IMAGE ANALYSIS OF BIOLOGICAL SPECIMENS 

Background of the Invention 

In the field of medical diagnostics including oncology, the detection, identification, 
quantitation and characterization of cells of interest, such as cancer cells, through testing of 
biological specimens is an important aspect of diagnosis. Typically, a biological specimen such as 
bone marrow, lymph nodes, peripheral blood, cerebrospinal fluid, urine, effusions, fine needle 
aspirates, peripheral blood scrapings or other materials are prepared by staining the specimen to 
identify cells of interest. One method of cell specimen preparation is to react a specimen with a 
specific probe which can be a monoclonal antibody, a polyclonal antiserum, or a nucleic acid which 
is reactive with a component of the cells of interest, such as tumor cells. The reaction may be 
detected using an enzymatic reaction, such as alkaline phosphatase or glucose oxidase or peroxidase 
to convert a soluble colorless substrate to a colored insoluble precipitate, or by directly conjugating a 
1 5 dye to the probe. 

Examination of biological specimens in the past has been performed manually by either a lab 
technician or a pathologist. In the manual method, a slide prepared with a biological specimen is 
viewed at a low magnification under a microscope to visually locate candidate cells of interest. 
Those areas of the slide where cells of interest are located are then viewed at a higher magnification 

20 to confirm those objects as cells of interest, such as tumor or cancer cells. The manual method is 
time consuming and prone to error including missing areas of the slide. 

Automated cell analysis systems have been developed to improve the speed and accuracy of 
the testing process. One known interactive system includes a single high power microscope 
objective for scanning a rack of slides, portions of which have been previously identified for assay 

25 by an operator. In that system, the operator first scans each slide at a low magnification similar to 
the manual method and notes the points of interest on the slide for later analysis. The operator then 
stores the address of the noted location and the associated function in a data file. Once the points of 
interest have been located and stored by the operator, the slide is then positioned in an automated 
analysis apparatus which acquires images of the slide at the marked points and performs an image 

30 analysis. 

Summary of the Imfinjjpj 

A problem with the foregoing automated system is the continued need for operator input to 
initially locate cell objects for analysis. Such continued dependence on manual input can lead to 
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errors including cells of interest being missed. Such errors can be critical especially in assays for so- 
called rare events, e.g., finding one tumor cell in a cell population of one million normal ceils. 
Additionally, manual methods can be extremely time consuming and can require a high degree of 
training to properly identify and/or quantify cells. This is not only true for tumor cell detection, but 

• 5 also for other applications ranging from neutrophil alkaline phosphatase assays, reticulocyte 
counting and maturation assessment, and others. The associated manual labor leads to a high cost 
for these procedures in addition to the potential errors that can arise from long, tedious manual 
examinations. A need exists, therefore, for an improved automated cell analysis system which can 
quickly and accurately scan large amounts of biological material on a slide. Accordingly, the 

10 present invention provides a method and apparatus for automated cell analysis which eliminates the 
need for operator input to locate cell objects for analysis. 

In accordance with the present invention, a slide prepared with a biological specimen and 
reagent is placed in a slide carrier which preferably holds four slides. The slide carriers are loaded 
into an input hopper of the automated system. The operator may then enter data identifying the size, 

15 shape and location of a scan area on each slide, or, preferably, the system automatically locates a 
scan area for each slide during slide processing. The operator then activates the system for slide 
processing. At system activation, a slide carrier is positioned on an X-Y stage of an optical system. 
Any bar codes used to identify slides are then read and stored for each slide in a carrier. The entire 
slide is rapidly scanned at a low magnification, typically lOx. At each location of the scan, a low 

20 magnification image is acquired and processed to detect candidate objects of interest. Preferably, 
color, size and shape are used to identify objects of interest. The location of each candidate object of 
interest is stored. 

At the completion of the low level scan for each slide in the carrier on the stage, the optical 
system is adjusted to a high magnification such as 40x or 60x, and the X-Y stage is positioned to the 

25 stored locations for the candidate objects of interest on each slide in the carrier. A high 
magnification image is acquired for each candidate object of interest and a series of image 
processing steps are performed to confirm the analysis which was performed at low magnification. 
A high magnification image is stored for each confirmed object of interest. These images are then 
available for retrieval by a pathologist or cytotechnologist to review for final diagnostic evaluation. 

30 Having stored the location of each object of interest, a mosaic comprised of the candidate objects of 
interest for a slide may be generated and stored. The pathologist or cytotechnologist may view the 
mosaic or may also directly view the slide at the location of an object of interest in the mosaic for 
further evaluation. The mosaic may be stored on magnetic media for future reference or may be 
transmitted to a remote site for review and/or storage. The entire process involved in examining a 
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single slide takes on the order of 2-15 minutes depending on scan area size and the number of 
detected candidate objects of interest. 

The present invention has utility in the field of oncology for the early detection of minimal 
residual disease ("micrometastases"). Other useful applications include prenatal diagnosis of fetal 
5 cells in maternal blood and in the field of infectious diseases to identify pathogens and viral loads, 
alkaline phosphatase assessments, reticulocyte counting, and others. 

The processing of images acquired in the automated scanning of the present invention 
preferably includes the steps of transforming the image to a different color space; filtering the 
transformed image with a low pass filter; dynamically thresholding the pixels of the filtered image to 
10 suppress background material; performing a morphological function to remove artifacts from the 
thresholded image; analyzing the thresholded image to determine the presence of one or more 
regions of connected pixels having the same color; and categorizing every region having a size 
greater than a minimum size as a candidate object of interest 

According to another aspect of the invention, the scan area is automatically determined by 
15 scanning the slide; acquiring an image at each slide position; analyzing texture information of each 
image to detect the edges of the specimen; and storing the locations corresponding to the detected 
edges to define the scan area. 

According to yet another aspect of the invention, automated focusing of the optical system is 
achieved by initially determining a focal plane from an array of points or locations in the scan area. 

20 The derived focal plane enables subsequent rapid automatic focusing in the low power scanning 
operation. The focal plane is determined by determining proper focal positions across an array of 
locations and performing an analysis such as a least squares fit of the array of focal positions to yield 
a focal plane across the array. Preferably, a focal position at each location is determined by 
incrementing the position of a Z stage for a fixed number of coarse and fine iterations. At each 

25 iteration, an image is acquired and a pixel variance or other optical parameter about a pixel mean for 
the acquired image is calculated to form a set of variance data. A least squares fit is performed on 
the variance data according to a known function. The peak value of the least squares fit curve is 
selected as an estimate of the best focal position, 

In another aspect of the present invention, another focal position method for high 

30 magnification locates a region of interest centered about a candidate object of interest within a slide 
which were located during an analysis of the low magnification images. The region of interest is 
preferably n columns wide, where n is a power of 2. The pixels of this region are then processed 
using a Fast Fourier Transform to generate a spectra of component frequencies and corresponding 
complex magnitude for each frequency component. Preferably, the complex magnitude of the 
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frequency components which range from 25% to 75% of the maximum frequency component are 

squared and summed to obtain the total power for the region of interest. This process is repeated for 
other Z positions and the Z position corresponding to the maximum total power for the region of 
interest is selected as the best focal position. This process is preferably used to select a Z position 
for regions of interest for slides containing neutrophils stained with Fast Red to identify alkaline 
phosphatase in cell cytoplasm and counterstained with hemotoxylin to identify the nucleus of the 
neutrophil cell. This focal method may be used with other stains and types of biological specimens, 
as well. 

According to still another aspect of the invention, a method and apparatus for automated 
slide handling is provided. A slide is mounted onto a slide carrier with a number of other slides 
side-by-side. The slide carrier is positioned in an input feeder with other slide carriers to facilitate 
automatic analysis of a batch of slides. The slide carrier is loaded onto the X-Y stage of the optical 
system for the analysis of the slides thereon. Subsequently, the first slide carrier is unloaded into an 
output feeder after automatic image analysis and the next carrier is automatically loaded. 
Brief Description of the Drawings 

The above and other features of the invention including various novel details of construction 
and combinations of parts will now be more particularly described with reference to the 
accompanying drawings and pointed out in the claims. It will be understood that the particular 
apparatus embodying the invention is shown by way of illustration only and not as a limitation of the 
invention. The principles and features of this invention may be employed in varied and numerous 
embodiments without departing from the scope of the invention. 

Fig. 1 is a perspective view of an apparatus for automated cell analysis embodying the 
present invention. 

Fig. 2 is a block diagram of the apparatus shown in Fig. 1 . 

Fig. 3 is a block diagram of the microscope controller of Fig. 2. 

Fig. 4 is a plan view of the apparatus of Fig. 1 having the housing removed. 

Fig. 5 is a side view of a microscope subsystem of the apparatus of Fig. 1 . 

Fig. 6a is a top view of a slide carrier for use with the apparatus of Fig. 1 . 

Fig. 6b is a bottom view of the slide carrier of Fig. 6a. 

Fig. 7a is a top view of an automated slide handling subsystem of the apparatus of Fig. 1 . 
Fig. 7b is a partial cross-sectional view of the automated slide handling subsystem of Fig. 7a 
taken on Iine^-y4. 

Fig. 8 is an end view of the input module of the automated slide handling subsystem. 
Figs. 8a-8d illustrate the input operation of the automatic slide handling subsystem. 
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Figs. 9a-9d illustrate the output operation of the automated slide handling subsystem. 

Fig. 10 is a flow diagram of the procedure for automatically determining a scan area. 

Fig. 1 1 shows the scan path on a prepared slide in the procedure of Fig. 10. 

Fig. 12 illustrates an image of a field acquired in the procedure of Fig. 10. 
5 Fig. 1 3 A is a flow diagram of a preferred procedure for determining a focal position. 

Fig. 13B is a flow diagram of a preferred procedure for determining a focal position for 
neutrophils stained with Fast Red and counterstained with hemotoxylin. 

Fig. 14 is a flow diagram of a procedure for automatically determining initial focus. 

Fig. 15 shows an array of slide positions for use in the procedure of Fig. 14. 
10 Fig. 16 is a flow diagram of a procedure for automatic focusing at a high magnification. 

Fig. 17A is a flow diagram of an overview of the preferred process to locate and identify 
objects of interest in a stained biological specimen on a slide. 

Fig. 17B is a flow diagram of a procedure for color space conversion. 

Fig. 18 is a flow diagram of a procedure for background suppression via dynamic 
15 thresholding. 

Fig. 19 is a flow diagram of a procedure for morphological processing. 

Fig. 20 is a flow diagram of a procedure for blob analysis. 

Fig. 21 is a flow diagram of a procedure for image processing at a high magnification. 
Fig. 22 illustrates a mosaic of cell images produced by the apparatus. 
20 Fig. 23 is a flow diagram of a procedure for estimating the number of nucleated cells in a 

scan area. 

Fig. 24 illustrates the apparatus functions available in a user interface of the apparatus. 

Detailed Description of the Pre ferred Emhodiment 

Referring now to the figures, an apparatus for automated cell analysis of biological 
25 specimens is generally indicated by reference numeral 10 as shown in perspective view in Fig. 1 and 
in block diagram form in Fig. 2. The apparatus 10 comprises a microscope subsystem 32 housed in 
a housing 12. The housing 12 includes a slide carrier input hopper 16 and a slide carrier output 
hopper 18. A door 14 in the housing 12 secures the microscope subsystem from the external 
environment. A computer subsystem comprises a computer 22 having a system processor 23, an 
30 image processor 25 and a communications modem 29. The computer subsystem further includes a 
computer monitor 26 and an image monitor 27 and other external peripherals including storage 
device 21, track ball device 30, keyboard 28 and color printer 35. An external power supply 24 is 
also shown for powering the system. Viewing oculars 20 of the microscope subsystem project from 
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the housing 12 for operator viewing. The apparatus 10 further includes a CCD camera 42 for 
acquiring images through the microscope subsystem 32. A microscope controller 31 under the 
control of system processor 23 controls a number of microscope-subsystem functions described 
further in detail. An automatic slide feed mechanism 37 in conjunction with X-Y stage 38 provide 
5 automatic slide handling in the apparatus 10. An illumination light source 48 projects light onto the 
X-Y stage 38 which is subsequently imaged through the microscope subsystem 32 and acquired 
through CCD camera 42 for processing in the image processor 25. A Z stage or focus stage 46 under 
control of the microscope controller 31 provides displacement of the microscope subsystem in the Z 
plane for focusing. The microscope subsystem 32 further includes a motorized objective turret 44 
1 0 for selection of objectives. 

The purpose of the apparatus 10 is for the unattended automatic scanning of prepared 
microscope slides for the detection and counting of candidate objects of interest such as normal and 
abnormal cells, e.g., tumor cells. The preferred embodiment may be utilized for rare event detection 
in which there may be only one candidate object of interest per several hundred thousand normal 

15 cells, e.g., one to five candidate objects of interest per 2 square centimeter area of the slide. The 
apparatus 10 automatically locates and counts candidate objects of interest and estimates normal 
cells present in a biological specimen on the basis of color, size and shape characteristics. A number 
of stains are used to preferentially stain candidate objects of interest and normal cells different colors 
so that such cells can be distinguished from each other. 

20 As noted in the background of the invention, a biological specimen may be prepared with a 

reagent to obtain a colored insoluble precipitate. The apparatus of the present invention is used to 
detect this precipitate as a candidate object of interest. 

During operation of the apparatus 10, a pathologist or laboratory technician mounts prepared 
slides onto slide carriers. A slide carrier 60 is illustrated in Fig. 8 and will be described further 

25 below. Each slide carrier holds up to 4 slides. Up to 25 slide carriers are then loaded « into input 
hopper 16. The operator can specify the size, shape and location of the area to be scanned or 
alternatively, the system can automatically locate this area. The operator then commands the system 
to begin automated scanning of the slides through a graphical user interface. Unattended scanning 
begins with the automatic loading of the first carrier and slide onto the precision motorized X-Y 

30 stage 38. A bar code label affixed to the slide is read by a bar code reader 33 during this loading 
operation. Each slide is then scanned at a user selected low microscope magnification, for example, 
lOx, to identify candidate cells based on their color, size and shape characteristics. The X-Y 
locations of candidate cells are stored until scanning is completed. 
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After the low magnification scanning is completed, the apparatus automatically returns to 
each candidate cell, reimages and refocuses at a higher magnification such as 40x and performs 
further analysis to confirm the cell candidate. The apparatus stores an image of the cell for later 
review by a pathologist. All results and images can be stored to a storage device 21 such as a 

• 5 removable hard drive or DAT tape or transmitted to a remote site for review or storage. The stored 
images for each slide can be viewed in a mosaic of images for further review. In addition, the 
pathologist or operator can also directly view a detected cell through the microscope using the 
included oculars 20 or on image monitor 27. 

Having described the overall operation of the apparatus 10 from a high level, the further 

10 details of the apparatus will now be described. Referring to Fig. 3, the microscope controller 31 is 
shown in more detail. The microscope controller 31 includes a number of subsystems connected 
through a system bus. A system processor 102 controls these subsystems and is controlled by the 
apparatus system processor 23 through an RS 232 controller 110. The system processor 102 controls 
a set of motor - control subsystems 1 14 through 124 which control the input and output feeder, the 

15 motorized turret 44, the X-Y stage 38, and the Z stage 46 (Fig. 2). A histogram processor 108 
receives input from CCD camera 42 for computing variance data during the focusing operation 
described further herein. 

The system processor 102 further controls an illumination controller 106 for control of 
substage illumination 48. The light output from the halogen light bulb which supplies illumination 

20 for the system can vary over time due to bulb aging, changes in optical alignment, and other factors. 
In addition, slides which have been "over stained" can reduce the camera exposure to an 
unacceptable level. In order to compensate for these effects, the illumination controller 106 is 
included. This controller is used in conjunction with light control software to compensate for the 
variations in light level. The light control software samples the output from the camera at intervals 

25 (such as between loading of slide carriers), and commands the controller to adjust the light level to 
the desired levels. In this way, light control is automatic and transparent to the user and adds no 
additional time to system operation. 

The system processor 23 is preferably comprised of dual parallel Intel Pentium 90 MHz 
devices. The image processor 25 is preferably a Matrox Imaging Series 640 model. The microscope 

30 controller system processor 1 02 is an Advanced Micro Devices AMD29K device. 

Referring now to Figs. 4 and 5, further detail of the apparatus 10 is shown. Fig. 4 shows a 
plan view of the apparatus 10 with the housing 12 removed. A portion of the automatic slide feed 
mechanism 37 is shown to the left of the microscope subsystem 32 and includes slide carrier 
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unloading assembly 34 and unloading platform 36 which in conjunction with slide carrier output 
hopper 18 function to receive slide carriers which have been analyzed. 

Vibration isolation mounts 40, shown in further detail in Fig. 5, are provided to isolate the 
microscope subsystem 32 from mechanical shock and vibration that can occur in a typical laboratory 
environment. In addition to external sources of vibration, the high speed operation of the X-Y stage 
38 can induce vibration into the microscope subsystem 32. Such sources of vibration can be isolated 
from the electro-optical subsystems to avoid any undesirable effects on image quality. The isolation 
mounts 40 comprise a spring 40a and piston 40b submerged in a high viscosity silicon gel which is 
enclosed in an elastomer membrane bonded to a casing to achieve damping factors on the order of 
1 7 to 20%. 

The automatic slide handling feature of the present invention will now be described. The 
automated slide handling subsystem operates on a single slide carrier at a time. A slide carrier 60 is 
shown in Figs. 6a and 6b which provide a top view and a bottom view respectively. The slide carrier 
60 includes up to four slides 70 mounted with adhesive tape 62. The carrier 60 includes ears 64 for 
hanging the carrier in the output hopper 18. An undercut 66 and pitch rack 68 are formed at the top 
edge of the slide carrier 60 for mechanical handling of the slide carrier. A keyway cutout 65 is 
formed in one side of the carrier 60 to facilitate carrier alignment. A prepared slide 72 mounted on 
the slide carrier 60 includes a sample area 72a and a bar code label area 72b. 

Fig. 7a provides a top view of the slide handling subsystem which comprises a slide input 
module 15, a slide output module 17 and X-Y stage drive belt 50. Fig. 7b provides a partial cross- 
sectional view taken along line A-A of Fig. 7a. 

The slide input module 15 comprises a slide carrier input hopper 16, loading platform 52 and 
slide carrier loading subassembly 54. The input hopper 1 6 receives a series of slide carriers 60 (Figs. 
6a and 6b) in a stack on loading platform 52. A guide key 57 protrudes from a side of the input 
hopper 16 to which the keyway cutout 65 (Fig. 6a) of the carrier is fit to achieve proper alignment. 

The input module 15 further includes a revolving indexing cam 56 and a switch 90 mounted 
in the loading platform 52, the operation of which is described further below. The carrier loading 
subassembly 54 comprises an infeed drive belt 59 driven by a motor 86. The infeed drive belt 59 
includes a pusher tab 58 for pushing the slide carrier horizontally toward the X-Y stage 38 when the 
belt is driven. A homing switch 95 senses the pusher tab 58 during a revolution of the belt 59. 

Referring specifically to Fig. 7a, the X-Y stage 38 is shown with x position and y position 
motors 96 and 97 respectively which are controlled by the microscope controller 31 (Fig. 3) and are 
not considered part of the slide handling subsystem. The X-Y stage 38 further includes an aperture 
55 for allowing illumination to reach the slide carrier. A switch 91 is mounted adjacent the aperture 
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55 for sensing contact with the carrier and thereupon activating a motor 87 to drive stage drive belt 
50 (Fig. 7b). The drive belt 50 is a double sided timing belt having teeth for engaging pitch rack 68 
of the carrier 60 (Fig. 6b). 

The slide output module 17 includes slide carrier output hopper 18, unloading platform 36 
and slide carrier unloading subassembly 34. The unloading subassembly 34 comprises a motor 89 
for rotating the unloading platform 36 about shaft 98 during an unloading operation described further 
below. An outfeed gear 93 driven by motor 88 rotatably engages the pitch rack 68 of the carrier 60 
(Fig. 6b) to transport the carrier to a rest position against switch 92. A springloaded hold-down 
mechanism holds the carrier in place on the unloading platform 36. 

The slide handling operation will now be described. Referring to Fig. 8, a series of slide 
carriers 60 are shown stacked in input hopper 16 with the top edges 60a aligned. As the slide 
handling operation begins, the indexing cam 56 driven by motor 85 advances one revolution to allow 
only one slide carrier to drop to the bottom of the hopper 16 and onto the loading platform 52. 

Figs. 8a-8d show the cam action in more detail. The cam 56 includes a hub 56a to which are 
mounted upper and lower leaves 56b and 56c respectively. The leaves 56b, 56c are semicircular 
projections oppositely positioned and spaced apart vertically. In a first position shown in Fig. 8a, the 
upper leaf 56b supports the bottom carrier at the undercut portion 66. At a position of the cam 56 
rotated 180°, shown in Fig. 8b, the upper leaf 56b no longer supports the carrier and instead the 
carrier has dropped slightly and is supported by the lower leaf 56c. Fig. 8c shows the position of the 
cam 56 rotated 270° wherein the upper leaf 56b has rotated sufficiently to begin to engage the 
undercut 66 of the next slide carrier while the opposite facing lower leaf 56c still supports the 
bottom carrier. After a full rotation of 360° as shown in Fig. 8d, the lower leaf 56c has rotated 
opposite the carrier stack and no longer supports the bottom carrier which now rests on the loading 
platform 52. At the same position, the upper leaf 56b supports the next carrier for repeating the 
cycle. 

Referring again to Figs. 7a and 7b, when the carrier drops to the loading platform 52, the 
contact closes switch 90 which activates motors 86 and 87. Motor 86 drives the infeed drive belt 59 
until the pusher tab 58 makes contact with the carrier and pushes the carrier onto the X-Y stage drive 
belt 50. The stage drive belt 50 advances the carrier until contact is made with switch 91, the closing 
of which begins the slide scanning process described further herein. Upon completion of the 
scanning process, the X-Y stage 38 moves to an unload position and motors 87 and 88 are activated 
to transport the carrier to the unloading platform 36 using stage drive belt 50. As noted, motor 88 
drives outfeed gear 93 to engage the carrier pitch rack 68 of the carrier 60 (Fig. 6b) until switch 92 is 
contacted. Closing switch 92 activates motor 89 to rotate the unloading platform 36. 
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The unloading operation is shown in more detail in end views of the output module 17 (Figs. 
9a-9d). In Fig. 9a, the unloading platform 36 is shown in a horizontal position supporting a slide 
carrier 60. The hold-down mechanism 94 secures the carrier 60 at one end. Fig. 9b shows the 
output module 17 after motor 89 has rotated the unloading platform 36 to a vertical position, at 
5 which point the spring loaded hold-down mechanism 94 releases the slide carrier 60 into the output 
hopper 1 8. The carrier 60 is supported in the output hopper 1 8 by means of ears 64 (Figs. 6a and 
6b). Fig. 9c shows the unloading platform 36 being rotated back towards the horizontal position. As 
the platform 36 rotates upward, it contacts the deposited carrier 60 and the upward movement pushes 
the carrier toward the front of the output hopper 18. Fig. 9d shows the unloading platform 36 at its 
1 0 original horizontal position after having output a series of slide carriers 60 to the output hopper 1 8. 

Having described the overall system and the automated slide handling feature, the aspects of 
the apparatus 10 relating to scanning, focusing and image processing will now be described in 
further detail. 

In some cases, an operator will know ahead of time where the scan area of interest is on the 
15 slide. Conventional preparation of slides for examination provides repeatable and known placement 
of the sample on the slide. The operator can therefore instruct the system to always scan the same 
area at the same location of every slide which is prepared in this fashion. But there are other times 
in which the area of interest is not known, for example, where slides are prepared manually with a 
known smear technique. One feature of the invention automatically determines the scan area using a 
20 texture analysis process. 

Fig. 1 0 is a flow diagram that describes the processing associated with the automatic location 
of a scan area. As shown in this figure, the basic method is to pre-scan the entire slide area to 
determine texture features that indicate the presence of a smear and to discriminate these areas from 
dirt and other artifacts. 

25 At each location of this raster scan, an image such as in Fig. 12 is acquired and analyzed for 

texture information at steps 204 and 206. Since it is desired to locate the edges of the smear sample 
within a given image, texture analyses are conducted over areas called windows 78, which are 
smaller than the entire image as shown in Fig. 12. The process iterates the scan across the slide at 
steps 208, 210, 212 and 214. 

30 In to* 5 interest of speed, the texture analysis process is performed at a lower magnification, 

preferably at a 4x objective. One reason to operate at low magnification is to image the largest slide 
area at any one time. Since cells do not yet need to be resolved at this stage of the overall image 
analysis, the 4x magnification is preferred. On a typical slide, as shown in Fig. 1 1, a portion 72b of 
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the end of the slide 72 is reserved for labeling with identification information. Excepting this label 
area, the entire slide is scanned in a raster scan fashion 76 to yield a number of adjacent images. 

Texture values for each window include the pixel variance over a window, the difference 
between the largest and smallest pixel value within a window, and other indicators. The presence of 
a smear raises the texture values compared with a blank area. 

One problem with a smear from the standpoint of determining its location is its non-uniform 
thickness and texture. For example, the smear is likely to be relatively thin at the edges and thicker 
towards the middle due to the nature of the smearing process. To accommodate for the 
non-uniformity, texture analysis provides a texture value for each analyzed area. The texture value 
tends to gradually rise as the scan proceeds across a smear from a thin area to a thick area, reaches a 
peak, and then falls off again to a lower value as a thin area at the edge is reached. The problem is 
then to decide from the series of texture values the beginning and ending, or the edges, of the smear. 
The texture values are fit to a square wave waveform since the texture data does not have sharp 
beginnings and endings. 

After conducting this scanning and texture evaluation operation, one must determine which 
areas of elevated texture values represent the desired smear 74, and which represent undesired 
artifacts. This is accomplished by fitting a step function, on a line by line basis to the texture values 
in step 216. This function, which resembles a single square wave across the smear with a beginning 
at one edge, and end at the other edge, and an amplitude provides the means for discrimination. The 
amplitude of the best-fit step function is utilized to determine whether smear or dirt is present since 
relatively high values indicate smear. If it is decided that smear is present, the beginning and ending 
coordinates of this pattern are noted until all lines have been processed, and the smear sample area 
defined at 218. 

After an initial focusing operation described further herein, the scan area of interest is 
scanned to acquire images for image analysis. The preferred method of operation is to initially 
perform a complete scan of the slide at low magnification to identify and locate candidate objects of 
interest, followed by further image analysis of the candidate objects of interest at high magnification 
in order to confirm the objects as cells. An alternate method of operation is to perform high 
magnification image analysis of each candidate object of interest immediately after the object has 
been identified at low magnification. The low magnification scanning then resumes, searching for 
additional candidate objects of interest. Since it takes on the order of a few seconds to change 
objectives, this alternate method of operation would take longer to complete. 

The operator can pre-select a magnification level to be used for the scanning operation. A 
low magnification using a lOx objective is preferred for the scanning operation since a larger area 
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can be initially analyzed for each acquired scan image. The overall detection process for a cell 
includes a combination of decisions made at both low (lOx) and high magnification (40x) levels. 
Decision making at the lOx magnification level is broader in scope, i.e., objects that loosely fit the 
relevant color, size and shape characteristics are identified at the lOx level. Analysis at the 40x 
magnification level then proceeds to refine the decision making and confirm objects as likely cells or 
candidate objects of interest. For example, at the 40x level it is not uncommon to find that some 
objects that were identified at lOx are artifacts which the analysis process will then reject. In 
addition, closely packed objects of interest appearing at lOx are separated at the 40x level. 

In a situation where a cell straddles or overlaps adjacent image fields, image analysis of the 
individual adjacent, image fields could result in the cell being rejected or undetected. To avoid 
missing such cells, the scanning operation compensates by overlapping adjacent image fields in both 
the x and y directions. An overlap amount greater than half the diameter of an average cell is 
preferred. In the preferred embodiment, the overlap is specified as a percentage of the image field in 
the x and y directions. 

15 The time to complete an image analysis can vary depending upon the size of the scan area 

and the number of candidate cells, or objects of interest identified. For one example, in the preferred 
embodiment, a complete image analysis of a scan area of two square centimeters in which 50 objects 
of interest are confirmed can be performed in about 12 to 15 minutes. This example includes not 
only focusing, scanning and image analysis but also the saving of 40x images as a mosaic on hard 

20 drive 21 (Fig. 2). 

Consider the utility of the present invention in a "rare event" application where there may be 
one, two or a very small number of cells of interest located somewhere on the slide. To illustrate the 
nature of the problem by analogy, if one were to scale a slide to the size of a football field, a tumor 
cell, for example, would be about the size of a bottle cap. The problem is then to rapidly search the 
25 football field and find the very small number of bottle caps and have a high certainty that none have 
been missed. 

However the scan area is defined, an initial focusing operation must be performed on each 
slide prior to scanning. This is required since slides differ, in general, in their placement in a carrier. 
These differences include slight (but significant) variations of tilt of the slide in its carrier. Since 
30 each slide must remain in focus during scanning, the degree of tilt of each slide must be determined. 
This is accomplished with an initial focusing operation that determines the exact degree of tilt, so 
that focus can be maintained automatically during scanning. 

The initial focusing operation and other focusing operations to be described later utilize a 
focusing method based on processing of images acquired by the system. This method was chosen 
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for its simplicity over other methods including use of IR beams reflected from the slide surface and 
use of mechanical gauges. These other methods also would not function properly when the 
specimen is protected with a coverglass. The preferred method results in lower system cost and 
improved reliability since no additional parts need be included to perform focusing. 
5 Fig. 13A provides a flow diagram describing the "focus point" procedure. The basic method 

relies on the fact that the pixel value variance (or standard deviation) taken about the pixel value 
mean is maximum at best focus. A "brute-force" method could simply step through focus, using the 
computer controlled Z, or focus stage, calculate the pixel variance at each step, and return to the 
focus position providing the maximum variance. Such a method would be too time consuming. 

10 Therefore, additional features were added as shown in Fig. 13 A. 

These features include the determination of pixel variance at a relatively coarse number of 
focal positions, and then the fitting of a curve to the data to provide a faster means of determining 
optimal focus. This basic process is applied in two steps, coarse and fine. 

During the coarse step at 220-230, the Z stage is stepped over a user-specified range of focus 

15 positions, with step sizes that are also user-specified. It has been found that for coarse focusing, 
these data are a close fit to a Gaussian function. Therefore, this initial set of variance versus focus 
position data are least-squares fit to a Gaussian function at 228. The location of the peak of this 
Gaussian curve determines the initial or coarse estimate of focus position for input to step 232. 

Following this, a second stepping operation 232-242 is performed utilizing smaller steps over 

20 a smaller focus range centered on the coarse focus position. Experience indicates that data taken 
over this smaller range are generally best fit by a second order polynomial. Once this least squares 
fit is performed at 240, the peak of the second order curve provides the fine focus position at 244. 

Fig. 14 illustrates a procedure for how this focusing method is utilized to determine the 
orientation of a slide in its carrier. As shown, focus positions are determined, as described above, for 

25 a 3 x 3 grid of points centered on the scan area at 264. Should one or more of these points lie outside 
the scan area, the method senses at 266 this by virtue of low values of pixel variance. In this case, 
additional points are selected closer to the center of the scan area. Fig. 1 5 shows the initial array of 
points 80 and new point 82 selected closer to the center. Once this array of focus positions is 
determined at 268, a least squares plane is fit to this data at 270. Focus points lying too far above or 

30 below this best-fit plane are discarded at 272 (such as can occur from a dirty cover glass over the 
scan area), and the data is then refit. This plane at 274 then provides the desired Z position 
information for maintaining focus during scanning. 

After determination of the best-fit focus plane, the scan area is scanned in an X raster scan 
over the scan area as described earlier. During scanning, the X stage is positioned to the starting 
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point of the scan area, the focus (Z) stage is positioned to the best fit focus plane, an image is 
acquired and processed as described later, and this process is repeated for all points over the scan 
area. In this way, focus is maintained automatically without the need for time-consuming refocusing 
at points during scanning. 

'5 Prior to confirmation of cell objects at a 40x or 60x level, a refocusing operation is 

conducted since the use of this higher magnification requires more precise focus than the best-fit 
plane provides. Fig. 16 provides the flow diagram for this process. As may be seen, this process is 
similar to the fine focus method described earlier in that the object is to maximize the image pixel 
variance. This is accomplished by stepping through a range of focus positions with the Z stage at 

10 276, 278, calculating the image variance at each position at 278, fitting a second order polynomial to 
these data at 282, and calculating the peak of this curve to yield an estimate of the best focus 
position at 284, 286. This final focusing step differs from previous ones in that the focus range and 
focus step sizes are smaller since this magnification requires focus settings to within 0.5 micron or 
better. 

1 5 . It should be noted that for some combinations of cell staining characteristics, improved focus 

can be obtained by numerically selecting the focus position that provides the largest variance, as 
opposed to selecting the peak of the polynomial. In such cases, the polynomial is used to provide an 
estimate of best focus, and a final step selects the actual Z position giving highest pixel variance. It 
should also be noted that if at any time during the focusing process at 40x or 60x the parameters 

20 indicate that the focus position is inadequate, the system automatically reverts to a coarse focusing 
process as described above with reference to Fig. 13 A. This ensures that variations in specimen 
thickness can be accommodated in an expeditious manner. 

For some biological specimens and stains, the focusing methods discussed above do not 
provide optimal focused results. For example, certain white blood cells known as neutrophils may 

25 be stained with Fast Red, a commonly known stain, to identify alkaline phosphatase in the cytoplasm 
of the cells. To further identify these cells and the material within them, the specimen may be 
counterstained with hemotoxylin to identify the nucleus of the cells. In cells so treated, the 
cytoplasm bearing alkaline phosphatase becomes a shade of red proportionate to the amount of 
alkaline phosphatase in the cytoplasm and the nucleus becomes blue. However, where the 

30 cytoplasm and nucleus overlap, the cell appears purple. These color combinations appear to 
preclude the finding of a focused Z position using the focus processes discussed above. 

In an effort to find a best focal position at high magnification, a focus method, such as the 
one shown in Fig. 13B, may be used. That method begins by selecting a pixel near the center of a 
candidate object of interest (Block 248) and defining a region of interest centered about the selected 
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pixel (Block 250). Preferably, the width of the region of interest is a number of columns which is a 
power of 2. This width preference arises from subsequent processing of the region of interest 
preferably using a one dimensional Fast Fourier Transform (FFT) technique. As is well known 
within, the art, processing columns of pixel values using the FFT technique is facilitated by making 
the number of columns to be processed a power of two. While the height of the region of interest is 
also a power of two in the preferred embodiment, it need not be unless a two dimensional FFT 
technique is used to process the region of interest. 

After the region of interest is selected, the columns of pixel values are processed using the 
preferred one dimensional FFT to determine a spectra of frequency components for the region of 
interest (Block 252). The frequency spectra ranges from DC to some highest frequency component. 
For each frequency component, a complex magnitude is computed. Preferably, the complex 
magnitudes for the frequency components which range from approximately 25% of the highest 
component to approximately 75% of the highest component are squared and summed to determine 
the total power for the region of interest (Block 254). Alternatively, the region of interest may be 
processed with a smoothing window, such as a Hanning window, to reduce the spurious high 
frequency components generated by the FFT processing of the pixel values in the region of interest. 
Such preprocessing of the region of interest permits all complex magnitude over the complete 
frequency range to be squared and summed. After the power for a region has been computed and 
stored (Block 256), a new focal position is selected, focus adjusted (Blocks 258, 260), and the 
process repeated. After each focal position has been evaluated, the one having the greatest power 
factor is selected as the one best in focus (Block 262). 

The following describes the image processing methods which are utilized to decide whether 
a candidate object of interest such as a stained tumor cell is present in a given image, or field, during 
the scanning process. Candidate objects of interest which are detected during scanning are reimaged 
at higher (40x or 60x) magnification, the decision confirmed, and a region of interest for this cell 
saved for later review by the pathologist. 

The image processing includes color space conversion, low pass filtering, background 
suppression, artifact suppression, morphological processing, and blob analysis. One or more of these 
steps can optionally be eliminated. The operator is provided with an option to configure the system 
to perform any or all of these steps and whether to perform certain steps more than once or several 
times in a row. It should also be noted that the sequence of steps may be varied and thereby 
optimized for specific reagents or reagent combinations; however, the sequence described herein is 
preferred. It should be noted that the image processing steps of low pass filtering, thresholding, 
morphological processing, and blob analysis are generally known image processing building blocks. 
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An overview of the preferred process is shown in Fig. 17A. The preferred process for 
identifying and locating candidate objects of interest in a stained biological specimen on a slide 
begins with an acquisition of images obtained by scanning the slide at low magnification (Block 
288). Each image is then converted from a first color space to a second color space (Block 290) and 
5 the color converted image is low pass filtered (Block 292). The pixels of the low pass filtered image 
are then compared to a threshold (Block 294) and, preferably, those pixels having a value equal to or 
greater than the threshold are identified as candidate object of interest pixels and those less than the 
threshold are determined to be artifact or background pixels. The candidate object of interest pixels 
are then morphologically processed to identify groups of candidate object of interest pixels as 

10 candidate objects of interest (Block 296). These candidate objects of interest are then compared to 
blob analysis parameters (Block 298) to further differentiate candidate objects of interest from 
objects which do not conform to the blob analysis parameters and, thus, do not warrant further 
processing. The location of the candidate objects of interest may be stored prior to confirmation at 
high magnification. The process continues by determining whether the candidate objects of interest 

15 have been confirmed (Block 300). If they have not been confirmed, the optical system is set to high 
magnification (Block 302) and images of the slide at the locations corresponding to the candidate 
objects of interest identified in the low magnification images are acquired (Block 288). These 
images are then color converted (Block 290), low pass filtered (Block 292), compared to a threshold 
(Block 294), morphologically processed (Block 296), and compared to blob analysis parameters 

20 (Block 298) to confirm which candidate objects of interest located from the low magnification 
images are objects of interest. The coordinates of the objects of interest are then stored for future 
reference (Block 303). 

Neural net processing schemes were not considered for the preferred embodiment for several 
reasons. Firstly, the preferred embodiment is optimized for "rare-event" detection, although it is not 

25 limited to this case. Since neural nets must be trained on what to look for, sometimes several 
thousands Of examples must be presented to the neural net for this training. This is impractical for a 
rare-event application. Secondly, neural net processing can be slower than "deterministic" methods, 
sometimes by large factors. Therefore, neural nets were not deemed appropriate for this application, 
although certain features of the invention may be advantageously applied to neural network systems. 

30 In general, the candidate objects of interest, such as tumor cells, are detected based on a 

combination of characteristics, including size, shape, and color. The chain of decision making based 
on these characteristics preferably begins with a color space conversion process. The CCD camera 
coupled to the microscope subsystem outputs a color image comprising a matrix of 640 x 480 pixels. 
Each pixel comprises red, green and blue (RGB) signal values. 
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It is desirable to transform the matrix of RGB values to a different color space because the 
difference between candidate objects of interest and their background, such as tumor and normal 
cells, may be determined from their respective colors. Specimens are generally stained with one or 
more industry standard stains (e.g., DAB, New Fuchsin, AEC) which are "reddish" in color. 
5 Candidate objects of interest retain more of the stain and thus appear red while normal cells remain 
unstained. The specimens may also be counterstained with hematoxalin so the nuclei of normal cells 
or cells not containing an object of interest appear blue. In addition to these objects, dirt and debris 
can appear as black, gray, or can also be lightly stained red or blue depending on the staining 
procedures utilized. The residual plasma or other fluids also present on a smear may also possess 
10 some color. 

In the color conversion operation, a ratio of two of the RGB signal values is formed to 
provide a means for discriminating color information. With three signal values for each pixel, nine 
different ratios can be formed: 

R/R, R/G, R/B, G/G, G/B, G/R, B/B, B/G, B/R 

15 The optimal ratio to select depends upon the range of color information expected in the slide 
specimen. As noted above, typical stains used for detecting candidate objects of interest such as 
tumor cells are predominantly red, as opposed to predominantly green or blue. Thus, the pixels of a 
cell of interest which has been stained contain a red component which is larger than either the green 
or blue components. A ratio of red divided by blue (R/B) provides a value which is greater than one 

20 for tumor cells but is approximately one for any clear or white areas on the slide. Since the 
remaining cells, i.e., normal cells, typically are stained blue, the R/B ratio for pixels of these latter 
cells yields values of less than one. The R/B ratio is preferred for clearly separating the color 
information typical in these applications. 

Fig. 17B illustrates the flow diagram by which this conversion is performed. In the interest 

25 of processing speed, the conversion is implemented with a look up table. The use of a look up table 
for color conversion accomplishes three functions: 1) performing a division operation; 2) scaling the 
result for processing as an image having pixel values ranging from 0 to 255; and 3) defining objects 
which have low pixel values in each color band (R,G,B) as "black" to avoid infinite ratios (i.e., 
dividing by zero). These "black" objects are typically staining artifacts or can be edges of bubbles 

30 caused by pasting a coverglass over the specimen. 

Once the look up table is built at 304 for the specific color ratio (i.e., choices of tumor and 
nucleated cell stains), each pixel in the original RGB image is converted at 308 to produce the 
output. Since it is of interest to separate the red stained tumor cells from blue stained normal ones, 
the ratio of color values is then scaled by a user specified factor. As an example, for a factor of 128 
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and the ratio of (red pixel value)/(blue pixel value), clear areas on the slide would have a ratio of 1 
scaled by 128 for a final X value of 128. Pixels which lie in red stained tumor cells would have X 
value greater than 128, while blue stained nuclei of normal cells would have value less than 128. In 
this way, the desired objects of interest can be numerically discriminated. The resulting 640 x 480 
5 pixel matrix, referred to as the ^T-image, is a gray scale image having values ranging from 0 to 255. 

Other methods exist for discriminating color information. One classical method converts the 
RGB color information into another color space, such as HSI (hue, saturation, intensity) space. In 
such a space, distinctly different hues such as red, blue, green,.yellow, may be readily separated. In 
addition, relatively lightly stained objects may be distinguished from more intensely stained ones by 

10 virtue of differing saturations. However, converting from RGB space to HSI space requires more 
complex computation. Conversion to a color ratio is faster; for example, a full image can be 
converted by the ratio technique of the present invention in about 30 ms while an HSI conversion 
can take several seconds. 

In yet another approach, one could obtain color information by taking a single color channel 

15 from the camera. As an example, consider a blue channel, in which objects that are red are relatively 
dark. Objects which are blue, or white, are relatively light in the blue channel. In principle, one 
could take a single color channel, and simply set a threshold wherein everything darker than some 
threshold is categorized as a candidate object of interest, for example, a tumor cell, because it is red 
and hence dark in the channel being reviewed. However, one problem with the single channel 

20 approach occurs where illumination is not uniform. Non-uniformity of illumination results in 
non-uniformity across the pixel values in any color channel, for example, tending to peak in the 
middle of the image and dropping off at the edges where the illumination falls off. Performing 
thresholding on this non-uniform color information runs into problems, as the edges sometimes fall 
below the threshold, and therefore it becomes more difficult to pick the appropriate threshold level. 

25 However, with the ratio technique, if the values of the red channel fall off from center to edge, then 
the values of the blue channel also fall off center to edge, resulting in a uniform ratio. Thus, the ratio 
technique is more immune to illumination non-uniformities. 

As previously described, the color conversion scheme is relatively insensitive to changes in 
color balance, i.e., the relative outputs of the red, green, and blue channels. However, some control 

30 is necessary to avoid camera saturation,' or inadequate exposures in any one of the color bands. This 
color balancing is performed automatically by utilizing a calibration slide consisting of a clear area, 
and a "dark" area having a known optical transmission or density. The system obtains images from 
the clear and "dark" areas, calculates "white" and "black" adjustments for the image processor 25, 
and thereby provides correct color balance. 
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In addition to the color balance control, certain mechanical alignments are automated in this 

process. The center point in the field of view for the various microscope objectives as measured on 

the slide can vary by several (or several tens of) microns. This is the result of slight variations in 

position of the microscope objectives 44a as determined by the turret 44 (Fig. 4), small variations in 
*5 alignment of the objectives with respect to the system optical axis, and other factors. Since it is 

desired that each microscope objective be centered at the same point, these mechanical offsets must 

be measured and automatically compensated. 

This is accomplished by imaging a test slide which contains a recognizable feature or mark. 

An image of this pattern is obtained by the system with a given objective, and the position of the 
10 mark determined. The system then rotates the turret to the next lens objective, obtains an image of 

the test object, and its position is redetermined. Apparent changes in position of the test mark are 

recorded for this objective. This process is continued for all objectives.. 

Once these spatial offsets have been determined, they are automatically compensated for by 

moving the stage 38 by an equal (but opposite) amount of offset during changes in objective. In this 
15 way, as different lens objectives are selected, there is no apparent shift in center point or area 

viewed. 

A low pass filtering process precedes thresholding. An objective of thresholding is to obtain 
, a pixel image matrix having only candidate objects of interest, such as tumor cells above a threshold 
level and everything else below it. However, an actual acquired image will contain noise. The noise 
20 can take several forms, including white noise and artifacts. The microscope slide can have small 
fragments of debris that pick up color in the staining process and these are known as artifacts. These 
artifacts are generally small and scattered areas, on the order of a few pixels, which are above the 
threshold. The purpose of low pass filtering is to essentially blur or smear the entire color converted 
image. The low pass filtering process will smear artifacts more than larger objects of interest, such 
25 as tumor cells and thereby eliminate or reduce the number of artifacts that pass the thresholding 
process. The result is a cleaner thresholded image downstream. 

In the low pass filter process, a 3 x 3 matrix of coefficients is applied to each pixel in the 640 
x 480 x-image. A preferred coefficient matrix is as follows: 
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At each pixel location, a 3 x 3 matrix comprising the pixel of interest and its neighbors is multiplied by 
the coefficient matrix and summed to yield a single value for the pixel of interest. The output of this 
spatial convolution process is again a 640 x 480 matrix. 

As an example, consider a case where the center pixel and only the center pixel, has a value 
5 of 255 and each of its other neighbors, top left, top, top right and so forth, have values of 0. This 
singular white pixel case corresponds to a small object. The result of the matrix multiplication and 
addition using the coefficient matrix is a value of 1/9 (255) or 28 for the center pixel, a value which 
is below the nominal threshold of 128. Now consider another case in which all the pixels have a 
value of 255 corresponding to a large object. Performing the low pass filtering operation on a 3 x 3 

10 matrix for this case yields a value of 255 for the center pixel. Thus, large objects retain their values 
while small objects are reduced in amplitude or eliminated. In the preferred method of operation, 
the low pass filtering process is performed on the X image twice in succession. 

In order to separate objects of interest, such as a tumor cell in the x image from other objects 
and background, a thresholding operation is performed designed to set pixels within cells of interest 

15 to a value of 255, and all other areas to 0. Thresholding ideally yields an image in which cells of 
interest are white and the remainder of the image is black. A problem one faces in thresholding is 
where to set the threshold level. One cannot simply assume that cells of interest are indicated by any 
pixel value above the nominal threshold of 128. A typical imaging system may use an incandescent 
halogen light bulb as a light source. As the bulb ages, the relative amounts of red and blue output 

20 can change. The tendency as the bulb ages is for the blue to drop off more than the red and the 
green. To accommodate for this light source variation over time, a dynamic thresholding process is 
used whereby the threshold is adjusted dynamically for each acquired image. Thus, for each 640 x 
480 image, a single threshold value is derived specific to that image. 

As shown in Fig. 18, the basic method is to calculate, for each field, the mean X value, and 

25 the standard deviation about this mean at 312. The threshold is then set at 314 to the mean plus an 
amount defined by the product of a (user specified) factor and the standard deviation of the color 
converted pixel values. The standard deviation correlates to the structure and number of objects in 
the image. Preferably, the user specified factor is in the range of approximately 1.5 to 2.5. The 
factor is selected to be in the lower end of the range for slides in which the stain has primarily 

30 remained within cell boundaries and the factor is selected to be in the upper end of the range for 
slides in which the stain is pervasively present throughout the slide. In this way, as areas are 
encountered on the slide with greater or lower background intensities, the threshold may be raised or 
lowered to help reduce background objects. With this method, the threshold changes in step with the 
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aging of the light source such that the effects of the aging are cancelled out. The image matrix 
resulting at 316 from the thresholding step is a binary image of black (O) and white (255) pixels. 

As is often the case with thresholding operations such as that described above, some 
undesired areas will lie above the threshold value due to noise, small stained cell fragments, and 
5 other artifacts. It is desired and possible to eliminate these artifacts by virtue of their small size 
compared with legitimate cells of interest. Morphological processes are utilized to perform this 
function. 

Morphological processing is similar to the low pass filter convolution process described 
earlier except that it is applied to a binary image. Similar to spatial convolution, the morphological 
10 process traverses an input image matrix, pixel by pixel, and places the processed pixels in an output 
matrix. Rather than calculating a weighted sum of neighboring pixels as in the low pass convolution 
process, the morphological process uses set theory operations to combine neighboring pixels in a 
nonlinear fashion. 

Erosion is a process whereby a single pixel layer is taken away from the edge of an object. 

1 5 Dilation is the opposite process which adds a single pixel layer to the edges of an object. The power 
of morphological processing is that it provides for further discrimination to eliminate small objects 
that have survived the thresholding process and yet are not likely tumor cells. The erosion and 
dilation processes that make up a morphological "open" preferably make small objects disappear yet 
allows large objects to remain. Morphological processing of binary images is described in detail in 

20 "Digital Image Processing", pages 127-137, G.A. Baxes, John Wiley & Sons, (1994). 

Fig. 19 illustrates the flow diagram for this process. As shown here, a morphological "open" 
process performs this suppression. A single morphological open consists of a single morphological 
erosion 320 followed by a single morphological dilation 322. Multiple "opens" consist of multiple 
erosions followed by multiple dilations. In the preferred embodiment, one or two morphological 

25 opens are found to be suitable. 

At this point in the processing chain, the processed image contains thresholded objects of 
interest, such as tumor cells (if any were present in the original image), and possibly some residual 
artifacts that were too large to be eliminated by the processes above. 

Fig. 20 provides a flow diagram illustrating a blob analysis performed to determine the 
30 number, size, and location of objects in the thresholded image. A blob is defined as a region of 
connected pixels having the same "color", in this case, a value of 255. Processing is performed over 
the entire image to determine the number of such regions at 324 and to determine the area and x,y 
coordinates for each detected blob at 326. 
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Comparison of the size of each blob to a known minimum area at 328 for a tumor cell allows 
a refinement in decisions about which objects are objects of interest, such as tumor cells, and which 
are artifacts. The location (x,y coordinates) of objects identified as cells of interest in this stage are 
saved for the final 40x reimaging step described below. Objects not passing the size test are 
5 disregarded as artifacts. 

The processing chain described above identifies objects at the scanning magnification as 
cells of interest candidates. As illustrated in Fig. 21, at the completion of scanning, the system 
switches to the 40x magnification objective at 330, and each candidate is reimaged to confirm the 
identification 332. Each 40x image is reprocessed at 334 using the same steps as described above 
10 but with test parameters suitably modified for the higher magnification (e.g. area). At 336, a region 
of interest centered on each confirmed cell is saved to the hard drive for review by the pathologist. 

As noted earlier, a mosaic of saved images is made available for viewing by the pathologist. 
As shown in Fig. 22, a series of images of cells which have been confirmed by the image analysis is 
presented in the mosaic 150. The pathologist can then visually inspect the images to make a 
1 5 determination whether to accept (1 52) or reject (1 53) each cell image. Such a determination can be 
noted and saved with the mosaic of images for generating a printed report. 

In addition to saving the image of the cell and its region, the cell coordinates are saved 
should the pathologist wish to directly view the cell through the oculars or on the image monitor. In 
this case, the pathologist reloads the slide carrier, selects the slide and cell for review from a mosaic 
20 of cell images, and the system automatically positions the cell under the microscope for viewing. 

It has been found that normal cells whose nuclei have been stained with hematoxylin are 
often quite numerous, numbering in the thousands per lOx image. Since these cells are so numerous, 
and since they tend to clump, counting each individual nucleated cell would add an excessive 
processing burden, at the expense of speed, and would not necessarily provide an accurate count due 
25 to clumping, The apparatus performs an estimation process in which the total area of each field that 
is stained hematoxylin blue is measured and this area is divided by the average size of a nucleated 
cell. Fig. 23 outlines this process. 

In this process, a single color band (the red channel provides the best contrast for blue 
stained nucleated cells) is processed by calculating the average pixel value for each field at 342, 
30 establishing two threshold values (high and low) as indicated at 344, 346, and counting the number 
of pixels between these two values at 348. In the absence of dirt, or other opaque debris, this 
provides a count of the number of predominantly blue pixels. By dividing this value by the average 
area for a nucleated cell at 350, and looping over all fields at 352, an approximate cell count is 
obtained. Preliminary testing of this process indicates an accuracy with +/- 15%. It should be noted 
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that for some slide preparation techniques, the size of nucleated cells can be significantly larger than 

the typical size. The operator can select the appropriate nucleated cell size to compensate for these 
characteristics. 

As with any imaging system, there is some loss of modulation transfer (i.e. contrast) due to 
the modulation transfer function (MTF) characteristics of the imaging optics, camera, electronics, 
and other components. Since it is desired to save "high quality" images of cells of interest both for 
pathologist review and for archival purposes, it is desired to compensate for these MTF losses. 

An MTF compensation, or MTFC, is performed as a digital process applied to the acquired 
digital images. A digital filter is utilized to restore the high spatial frequency content of the images 
upon storage, while maintaining low noise levels. With this MTFC technology, image quality is 
enhanced, or restored, through the use of digital processing methods as opposed to conventional 
oil-immersion or other hardware based methods. MTFC is described further in "The Image 
Processing Handbook," pages 225 and 337, J. C. Rues, CRC Press (1995). 

Referring to Fig. 24, the functions available in a user interface of the apparatus 10 are shown. 
From the user interface, which is presented graphically on computer monitor 26, an operator can 
select among apparatus functions which include acquisition 402, analysts 404, and system 
configuration 406. At the acquisition level 402, the operator can select between manual 408 and 
automatic 410 modes of operation. In the manual mode, the operator is presented with manual 
operations 409. Patient information 414 regarding an assay can be entered at 412. 

In the analysis level 404, review 416 and report 418 functions are made available. At the 
review level 416, the operator can select a montage function 420. At this montage level, a 
pathologist can perform diagnostic review functions including visiting an image 422, accept/reject of 
cells 424, nucleated cell counting 426, accept/reject of cell counts 428, and saving of pages at 430. 
The report level 418 allows an operator to generate patient reports 432. 

In the configuration level 406, the operator can select to configure preferences at 434, input 
operator information 437 at 436, create a system log at 438, and toggle a menu panel at 440. The 
configuration preferences include scan area selection functions at 442, 452; montage specifications 
at 444, bar code handling at 446, default cell counting at 448, stain selection at 450, and scan 
objective selection at 454. 

Equivalents 

While this invention has been particularly shown and described with references to preferred 
embodiments thereof, it will be understood by those skilled in the art that various changes in form 



23 



WO 97/20198 PCT/US96/19166 

and details may be made therein without departing from the spirit and scope of the invention 
defined by the appended claims. 
What is claimed is: 
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CLAIMS 

1. A method of automatic analysis of an image of a magnified area of a slide having a 
biological specimen prepared with a reagent comprising the steps of: 

transforming pixels of an image in a first color space to a second color space to 
differentiate candidate object of interest pixels from background pixels; and 

identifying candidate objects of interest from the candidate object of interest pixels in the 
second color space. 

2. The method of Claim 1 wherein the first color space includes red, green, and blue 
components for each pixel and the transforming step includes forming a ratio between two components 
of the red, blue and green components for each pixel in the first color space to transform the pixels to the 
second color space. 

3. The method of Claim 2 further comprising the step of: 

selecting a grayscale value for each pixel in the second color space which corresponds to 
the ratio of components in the first color space. 

4. The method of Claim 1 wherein the first color space includes red, green, and blue 
components for each pixel and the transforming step includes converting components of the red, blue and 
green components for each pixel in the first color space to pixel values in a hue, saturation, and intensity 
space. 

5. The method of Claim 1 wherein the first color space includes red, green, and blue 
components for each pixel and the transforming step includes comparing pixel values for a single 

-25- 



WO 97/20 1 98 PCT/US96/1 91 66 

component for each pixel to a threshold to identify pixels having a component value equal to or greater 

than said threshold as candidate object of interests pixels and pixels having a component value less than 

the threshold as background pixels. 



6. The method of claim 1 further comprising the steps of: 

morphologically processing the candidate object of interest pixels to identify artifact 

pixels; and 

identifying the candidate objects of interest from the remaining candidate object of 
interest pixels not identified as artifact pixels. 



7. The method of claim 6 further comprising the steps of: 

filtering said candidate object of interest pixels with a low pass filter prior to 
morphologically processing the low pass filtered candidate object of interest pixels. 



8. The method of claim 7 further comprising the steps of: 

comparing said low passed filtered candidate object of interest pixels to a threshold prior 
to morphologically processing the candidate object of interest pixels which have values greater than or 
equal to the threshold value. 



9. The method of claim 8 further comprising the steps of: 

computing a mean value of said candidate object of interest pixels; 
specifying a threshold factor; 

computing a standard deviation for the candidate object of interest pixels; and 
setting the threshold to the sum of the mean value and the product of the threshold factor 
and the standard deviation prior to comparing the candidate object of interest pixels to the threshold. 
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10. The method of claim 7 the identifying further comprising the steps of: 

grouping said morphologically processed candidate object of interest pixels into regions 
of connected candidate object of interest pixels to identify objects of interest; 
5 comparing said objects of interest to blob analysis parameters; and 

storing location coordinates of the candidate objects of interest having an area 
corresponding to the blob analysis parameters. 



1 1 . The method of claim 10 wherein said previously performed method steps are performed 
10 on images acquired at low magnification and the method further comprising the steps of: 

adjusting an optical system viewing the slide from which the objects of interest were 
identified to high magnification; 

acquiring a high magnification image of the slide at the corresponding location 
coordinates for each candidate object of interest; 
15 transforming pixels of the high magnification image in the first color space to a second 

color space to differentiate high magnification candidate objects of interest pixels from background 
pixels; and 

identifying high magnification objects of interest from the candidate object of interest 
pixels in the second color space. 

20 

12. The method of claim 1 1 further comprising the steps of: 

morphologically processing the high magnification candidate object of interest pixels to 
identify artifact pixels; and 

identifying the high magnification objects of interest from the remaining high 
25 magnification candidate object of interest pixels not identified as artifact pixels. 
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13. The method of claim 12 further comprising the steps of: 

filtering said high magnification candidate object of interest pixels with a low pass filter 
prior to morphologically processing the low pass filtered high magnification candidate object of interest 
5 pixels. 



14. The method of claim 13 further comprising the steps of: 

comparing said low passed filtered high magnification candidate object of interest pixels 
to a threshold prior to morphologically processing the high magnification candidate object of interest 
10 pixels which have values greater than or equal to the threshold value. 



1 5. The method of claim 14 further comprising the steps of: 

computing a mean value of said high magnification candidate object of interest pixels; 
specifying a threshold factor; 
15 computing a standard deviation for the high magnification candidate object of interest 

pixels; and 

setting the threshold to the sum of the mean value and the product of the threshold factor 
and the standard deviation prior to comparing the high magnification candidate object of interest pixels 
to the threshold. 

20 

16. The method of claim 15 further comprising the steps of: 

grouping said low passed filtered high magnification candidate object of interest pixels 
into regions of connected high magnification candidate object of interest pixels to identify high 
magnification objects of interest; 

25 comparing said high magnification objects of interest to blob analysis parameters; and 

« 
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storing location coordinates of the high magnification objects of interest corresponding 

to the blob analysis parameters. 



17. The method of claim 11 wherein an optical system is initially focused prior to 
5 performing the low magnification steps. 



18. The method of claim 17 wherein the initial focusing of the optical system prior to 
performing the low magnification steps, further comprises the steps of: 

(a) positioning the optical system at an initial Z stage position; 
10 (b) acquiring at low magnification an image of a slide having a stained biological 

specimen thereon and calculating a pixel variance about a pixel mean for the acquired image; 

(c) incrementing the position of the Z stage; 

(d) repeating steps (b) and (c) for a fixed number of coarse iterations to form a first 
set of variance data; 

1 5 (e) performing a least squares fit of the first set of variance data to a first function; 

(0 positioning the Z stage at a position near the peak of the first function; 
(g) repeating steps (b) and (c) for a fixed number of fine iterations to form a second 
set of variance data; 



(h) performing a least squares fit of the second set of variance data to a second 



20 function; 



(0 



selecting the peak value of the least squares fit curve as an estimate of the best 



focal position; and 



(j) performing the above steps for an array of X-Y stage positions to form an array 
of focal positions and performing a least squares fit of the array of focal positions to yield a least squares 
25 fit focal plane. 
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19. The method of claim 17 wherein the initial focusing of the optical system prior to 

performing the low magnification steps, further comprises the steps of; 

(a) positioning the optical system at an initial Z stage position; 

(b) acquiring an image and calculating a pixel variance about a pixel mean for the 

acquired image; 

(c) incrementing the position of the Z stage; 

(d) repeating steps (b) and (c) for a fixed number of iterations; 

(e) performing a least squares fit of the variance data to a known function; and 

(f) selecting the peak value of the least squares fit curve as an estimate of the best 

focal position. 



20. The method of claim 1 1 the adjusting optical system step further comprising the steps of: 

(a) positioning the optical system at an initial Z stage position; 

(b) acquiring an image and selecting a center pixel of a candidate object of interest; 

(c) defining a region of interest centered about the selected center pixel; 

(d) performing a fast fourier transform of said region of interest to identify 
frequency components for the region of interest and complex magnitudes for the frequency components; 

(e) computing a power value by summing the square of the complex magnitudes for 
the frequency components that are within the range of frequencies of 25% to 75% of a maximum 
frequency component for the fast fourier transform of the region of interest; 

(0 incrementing the position of the Z stage; 

(g) repeating steps (b) - (e) for a fixed number of iterations; and 

(h) selecting the Z stage position corresponding to the largest power value as the 
best focal position. 



21. The method of claim 1 1 the adjusting optical system step further comprising the steps of: 
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(a) positioning the optical system at an initial Z stage position; 

(b) acquiring an image and selecting a center pixel of a candidate object of interest; 

(c) defining a region of interest centered about the selected center pixel; 

(d) applying a Hanning window function to the region of interest; 

5 (d) performing a fast fourier transform of said region of interest following the 

application of the Hanning window function to identify frequency components for the region of interest 
and complex magnitudes for the frequency components; 

(e) computing a power value by summing the square of the complex magnitudes for 
the frequency components for the fast fourier transform of the region of interest; 

10 (f) incrementing the position of the Z stage; 

(g) repeating steps (b) - (e) for a fixed number of iterations; and 

(h) selecting the Z stage position corresponding to the largest power value as the 
best focal position. 

22. A method for initially focusing an optical system of an automated microscopy system 

comprising the steps of: 

(a) positioning an optical system at an initial Z stage position; 

(b) acquiring an image and calculating a pixel variance about a pixel mean for the 

acquired image; 

(c) incrementing the position of the Z stage; 

(d) repeating steps (b) and (c) for a fixed number of iterations; 

(e) performing a least squares fit of the variance data to a known function; and 

(f) selecting the peak value of the least squares fit curve as an estimate of the best 

focal position. 



15 



20 



25 
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23. A method for initially focusing an optical system of an automated microscopy system 

comprising the steps of: 

(a) positioning an optical system at an initial Z stage position; 

(b) acquiring at low magnification an image of a slide having a stained biological 
5 specimen thereon and calculating a pixel variance about a pixel mean for the acquired image; 

(c) incrementing the position of the Z stage; 

(d) repeating steps (b) and (c) for a fixed number of coarse iterations to form a first 
set of variance data; 

(e) performing a least squares fit of the first set of variance data to a first function; 



10 



(f) positioning the Z stage at a position near the peak of the first function; 



(g) repeating steps (b) and (c) for a fixed number of fine iterations to form a second 



set of variance data; 



(h) performing a least squares fit of the second set of variance data to a second 



function; 



15 



(i) selecting the peak value of the least squares fit curve as an estimate of the best 



20 



focal position; and 

(j) performing the above steps for an array of X-Y stage positions to form an array 
of focal positions and performing a least squares fit of the array of focal positions to yield a least squares 
fit focal plane. 



24. A method for focusing an optical system of an automated microscopy system at high 
magnification comprising the steps of: 

(a) positioning an optical system at an initial Z stage position; 

(b) acquiring an image and selecting a center pixel of a candidate object of interest; 



25 



(c) defining a region of interest centered about the selected center pixel; 



-32- 



WO 97/20198 PCT/US96/19166 

(d) performing a fast fourier transform of said region of interest to identify 

frequency components for the region of interest and complex magnitudes for the frequency components; 

(e) computing a power value by summing the square of the complex magnitudes for 
the frequency components that are within the range of frequencies of 25% to 75% of a maximum 
frequency component for the fast fourier transform of the region of interest; 

(f) incrementing the position of the Z stage; 

(g) repeating steps (b) - (e) for a fixed number of iterations; and 

(h) selecting the Z stage position corresponding to the largest power value as the 
best focal position. 



25. A method for focusing an optical system of an automated microscopy system at high 
magnification comprising the steps of: 

(a) positioning an optical system at an initial Z stage position; 

(b) acquiring an image and selecting a center pixel of a candidate object of interest; 

(c) defining a region of interest centered about the selected center pixel; 

(d) applying a Hanning window function to the region of interest; 

(d) performing a fast fourier transform of said region of interest following the 
application of the Hanning window function to identify frequency components for the region of interest 
and complex magnitudes for the frequency components; 

(e) computing a power value by summing the square of the complex magnitudes for 
the frequency components for the fast fourier transform of the region of interest; 

(f) incrementing the position of the Z stage; 

(g) repeating steps (b) - (e) for a fixed number of iterations; and 

(h) selecting the Z stage position corresponding to the largest power value as the 
best focal position. 
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26. Apparatus for automatic image analysis of a slide having a biological specimen, 



comprising: 



an optical system having an X-Y stage; 

means for scanning over a scan area of the slide at a plurality of locations at low 



5 magnification of the optical system; 



interest; 



means for acquiring a low magnification image at each location in the scan area; 

a processor for processing each low magnification image to detect candidate objects of 



10 



means for storing X-Y coordinates of each location for each candidate object of interest; 
means for adjusting the optical system to a high magnification; 

means for repositioning the X-Y stage to the location for each candidate object of 



15 



interest; 



comprising: 



means for acquiring a high magnification image of each candidate object of interest; and 
a storage device for storing each high magnification image. 



27. Apparatus for automatic image analysis of a slide having a biological specimen, 



an optical system having an X-Y stage; 

means for scanning over a scan area of the slide at a plurality of locations at low 



20 magnification of the optical system; 



means for acquiring a low magnification image at each location in the scan area; 

a processor for processing each low magnification image to detect candidate objects of 



interest; 



25 



means for storing X-Y coordinates of each location for each candidate object of interest; 
means for adjusting the optical system to a high magnification; 



-34- 



WO 97/20198 PCT/US96/19166 

means for repositioning the X-Y stage to the location for each candidate object of 

interest; 

means for acquiring a high magnification image of each candidate object of interest; and 
a storage device for storing each high magnification image. 
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